Awni00's group workspace
Abstractor - L=2, d=128, h=8
What makes this group special?
Tags
driven-sky-10
Notes
Author
State
Finished
Start time
August 2nd, 2024 4:51:44 PM
Runtime
9h 48m 26s
Tracked hours
-
Run path
dual-attention/dual_attention--math--algebra__sequence_next_term/9z18jxwl
OS
Linux-4.18.0-477.51.1.el8_8.x86_64-x86_64-with-glibc2.28
Python version
3.11.7
Git repository
git clone https://www.github.com/awni00/abstract_transformer
Git state
git checkout -b "driven-sky-10" 9ce16be9a1cd1fba943e52654fc94c965461f2e0
Command
/gpfs/radev/project/lafferty/ma2393/abstract_transformer/experiments/math/train_abstractor_model.py --task algebra__sequence_next_term --n_epochs 100 --batch_size 512 --d_model 128 --dff 256 --symbol_type symbolic_attention --n_layers 2 --n_heads 8
System Hardware
| CPU count | 32 |
| Logical CPU count | 32 |
| GPU count | 1 |
| GPU type | NVIDIA A40 |
W&B CLI Version
0.17.5
Config
Config parameters are your model's inputs. Learn more
- {} 12 keys▶
- {} 12 keys▶
- 128
- {} 7 keys▶
- {} 6 keys▶
- "Abstractor - L=2, d=128, h=8"
- 161
- {} 2 keys▶
- "token"
- 85
- 2
- 2
- 31
- 85
- {} 2 keys▶
- "token"
- 85
Summary
Summary metrics are your model's outputs. Learn more
- {} 10 keys▶
- 99
- 0.03653674200177193
- 1.1599539518356323
- 0.9533318281173706
- 0.03653674200177193
- 1.1599539518356323
- 0.9533318281173706
- 0.0160044115036726
- 0.9698988795280457
- 390,599
Artifact Outputs
This run produced these artifacts as outputs. Total: 1. Learn more
Type
Name
Consumer count
Loading...